Towards Privacy-Preserving Relational Data Synthesis via Probabilistic Relational Models
Published in Forty-Seventh German Conference on Artificial Intelligence, 2024
Malte Luttermann, Ralf Möller, Mattis Hartwig. (2024). "Towards Privacy-Preserving Relational Data Synthesis via Probabilistic Relational Models." Proceedings of the Forty-Seventh German Conference on Artificial Intelligence (KI-2024). Springer, Volume 14992, pages 175-189. https://link.springer.com/chapter/10.1007/978-3-031-70893-0_13
Abstract
Probabilistic relational models provide a well-established formalism to combine first-order logic and probabilistic models, thereby allowing to represent relationships between objects in a relational domain. At the same time, the field of artificial intelligence requires increasingly large amounts of relational training data for various machine learning tasks. Collecting real-world data, however, is often challenging due to privacy concerns, data protection regulations, high costs, and so on. To mitigate these challenges, the generation of synthetic data is a promising approach. In this paper, we solve the problem of generating synthetic relational data via probabilistic relational models. In particular, we propose a fully-fledged pipeline to go from relational database to probabilistic relational model, which can then be used to sample new synthetic relational data points from its underlying probability distribution. As part of our proposed pipeline, we introduce a learning algorithm to construct a probabilistic relational model from a given relational database.
Resources for this Paper
BibTeX Citation
@inproceedings{Luttermann2024e,
author = {Malte Luttermann and Ralf Möller and Mattis Hartwig},
title = {{Towards Privacy-Preserving Relational Data Synthesis via Probabilistic Relational Models}},
booktitle = {Proceedings of the Forty-Seventh German Conference on Artificial Intelligence (KI-2024)},
year = {2024},
pages = {175--189},
publisher = {Springer},
}